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(54) Conditional instruction execution 
in a pipelined processor 

(57) A pipelined digital processor in- 
cludes control circuits which during a 
first processor cycle decode a single 
conditional instruction for controlling 
performance of a specific condition test 
during thenext (second) processor cy- 
cle and decode another instruction 
word during the second processor cycle 
for controlling all processing section 
operations during the subsequent 
(third) processor cycle. A circuit per- 
forms the condition test by comparing 
conditions existing in the digital proces- 
sor during the second processor cycle 
with the specific condition information 
included in the conditional instruction 
for selectively disabling control of at 
least one section of the digital proces- 
sor during the third processor cycle 
depending on the result of the condition 
test. 

As described the processor is a multi- 
plier using Booth's algorithm. The pipe- 
lined sections include a multiplier form- 
ing succcessive partial products, an 
accumulator accumulating those pro- 
ducts, and a rounding circuit to round 
the accumulated product. 



The drawings originally filed were informal and the print here reproduced is taken from a later filed formal copy. 
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SPECIFICATION 
Pipelined digital processor 

5 The invention relates to a pipelined digital processor which is described more particularly as a processor 5 
arranged for conditional operations. 

Stored program control digital computers typically include a memory, input-output circuitry, a controller 
and an arithmetic section. The memory provides a source for a computer program and data to be operated 
on by the arithmetic section. The arithmetic section includes circuits which provide means for manipulating 
10 data in a predetermined manner. The controller provides control signals for regulating timing and transfers 10 
of data to be operated upon. The input-output circuitry provides means for transferring information between 
the computer and external devices. Some operations of the computer may be conditioned upon flags, or 
conditions, existing as a result of prior operations or other events. 
• To increase computational speed, some digital computers are arranged for pipelined operation. In a 
1 5 pipelined operation the arithmetic unit, or section, includes a collection of specialized circuits capable of 15 
working simultaneously but altogether forming a general purpose organization. These specialized circuits 
■ operate independently, each performing a specific task in a general purpose procedure. The pipelined 
operation divides a process into several subprocesses which are executed by the individual specialized 
circuits. Successive ones of the subprocesses are carried out in an overlapped mode analogous to an 
20 industrial assembly line. New operands are applied at the input to the arithmetic section during each cycle. 20 
Different subsections of the arithmetic section perform their tasks in sequential order during subsequent 
cycles A resultant is produced each cycle. Each specialized circuit performs its own task at the cyclic rate. 

Control of a pipelined computer, or processor, presents particularly perplexing problems when operations 
are to be executed conditionally because instructions become stacked up in the pipeline during steady-state 

25 

25 ^Heretofore a pipelined digital processor has been designed to transfer data words and instructions from 
memory to the arithmetic section and a control section in respective pipelined streams. These streams of 
data words and instruction words fill pipelines of circuits within the processor. As long as the processor 
operates normally, the pipelines of information are processed step by step through sections of the processor 

30 '"a problem anses^however, when an operation must be executed conditionally. Typically this operation is 
realized by a conditiona I transfer that causes execution of one of two alternative sequences of one or more 
instructions. Since one of these sequences of instructions is in the processor pipeline when the cond.t.on is 
tested it may be necessary to abort execution of that sequence and start to fill the pipeline for execution of 

35 the alternative sequence. Processing time is lost wheneverthis alternate sequence is invoked. . 35 

This problem is solved in.an exemplary pipelined digital processor arranged for conditional operation. The 
processor includes a source of instructions and data words. An arithmetic section processes one data word 
with another data word through selected processing subsections performing operations according to an 
expression, thereby producing a resultant data word. A destination receives the resultant data word from the 

40 arithmetic section. Control circuits decode a single conditional instruction word for controlling performance 40 
of a specific condition test during a first subsequent processor cycle. The control circuits further decoding 
another instruction word during the first subsequent processor cycle for controlling all processing sections 
operations during a second subsequent processor cycle. The specific cdndition test is performed by a circuit 
which compares conditions existing in the digital processor during the first subsequent processor cycle with 

45 information included in the conditional instruction for selectively disabling control of at least one sect.on of 45 

- the processor during the second subsequent processor cycle. 

An embodiment of the invention will now be described with reference to the drawing, wherein 
Figures 1 and 2 when positioned as shown in Figure 3 form a block diagram of a p.pelmed digital signal 

- processor embodying the invention. 
50 Figure 4 is a timing diagram; 

Figures 5 and 6, when positioned as shown in Figure 7, form a processor function chart; and 
Figure 8 is a processor function chart for a conditional operation. 

Referring now to Figures 1 and 2. there is shown the overall architecture of a pipelined digital signal 

processor. , , c . . „ 

A read only memory 1 00 stores instructions and fixed data words. Instructions are transferred from the 55 
read only memory by way of a common data and control bus 101 to instruction registers IR-C; IR-L,M,N; and 



55 



rwuuniy iiicmuiy uy v»oy uwv w ■ • , (( . . . 

IR-S T Parts of instructions are distributed to the instruction registers. Fixed data words, or coefficient 
words, are transferred from the read only memory by way of the common data and control bus 1 01 to a 
coefficient register 102. The register 1 02 is labelled REG X because the coefficients are identified hereinafter 



60 



60 by the symbol x. . 

A random access memory 105 stores variable data words which may be stored therein either from an 
external source or from the output of the arithmetic section of this processor. The variable data words are 
transferred from the random access memory by way of the common data and control bus 101 to a variable 
data register 1 06. The register 1 06 is labelled REG Y because variable data words are identified hereinafter by 
the symbol y By choice of the user, the random access memory may store coefficients used in place of fixed 65 
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data words as well as the variable data words. 

Registers 1 02 and 106, respectively, store a sequential stream of coefficient words and variable data words 
which are operands applied as inputs to an arithmetic section 110. These sequences of operands are 
processed in a pipeline fashion through a multiplier subsection 112, an accumulator subsection 115 and a 
5 rounding and overflow circuit subsection 116. A rounded output word is produced in a register 118 that is 5 
labelled REG W because rounded output words are identified by the symbol w hereinafter. 

An output selector circuit 1 20 is included within the arithmetic section for choosing as an output word from 
the arithmetic section to the data bus 101 either the variable data word / stored in register 106 or the 
rounded output word w stored in register 118. The rounded output word w is a resultant of some process 

10 performed jay the arithmetic section. The chosen output word can be transferred from either the register 106 10 
or the register 118 byway of the common data and control bus 101 to a writeable destination, such as in the 
random access memory 1 05. 

As previously mentioned, instructions for the digital signal processor are stored in read only memory 100. 
During each processor cycle, shown in Figure 4, a single instruction automatically is read out of read only 

15 memory from a location having an address produced by an address arithmetic unit, or section, 124. The 15 
address from a program counter register PC in the address arithmetic section is applied by way of an address 
bus 1 28 to the address circuitry of the read only memory. Read only memory responds during each 
processor cycle by sending the single instruction thus fetched by way of the common data and control bus to 
the various control field, or instruction, registers IR-C, IR-L,M,N, and IR-S,T associated with different sections 

20 of the processor. 20 
Each instruction, or opcode, used in the digital signal processor includes a plurality of control fields, or 
control messages, each of which is given a designation such as /, m, n f s and f to be used hereinafter. The 
control field register IR-L,M,N associated with arithmetic section 1 10 receives some of the fields, such as 
instruction fields /, m and n, respectively associated with control of multiplying, accumulating and rounding 

25 operations. The control field register IR-S.T, associated with the address arithmetic section 1 24, receives 25 
instruction fields s and f which relate to control of address register modification for controlling the fetching 
of operands x and /and the storing of the output word chosen by the selector circuit 120. 

The address arithmetic section 124 includes two sets of registers 141 and 142, an address bus latch 145, an 
adder 1 47 and an adder latch 1 50 interconnected by some busses. 

30 One set of registers 141, including registers RX, RY, RD, and PC, is arranged to store memory addresses. 30 
An address stored in register RX can be used tor accessing a coefficient word stored in a location in either 
random access memory or read only memory. An address stored in register RY can be used only for 
accessing a variable data word stored in a location in random access memory. An address stored in the 
register RD can be used for writing a resultant data word into a destination, such as a location in random 

35 access memory. Ah address stored in the program counter register PC is used for accessing the next 35 
instruction or fixed data word from the read only memory. 

The second set of registers 142 is arranged to store variable increment values to be used for incrementing 
automatically addresses stored in registers RX, RY and RD. Alternatively, the stored addresses may be 
incremented by one of a set of fixed value increments. 

40 Operations of the digital signal processor are controlled by two types of instructions. Normal instructions 40 
are used most of the time. They control the performance of arithmetic operations during signal processing. 
Another type of instruction, used occasionally, is called an auxiliary instruction. One specific auxiliary 
instruction controls the loading of an address register or an address increment register in the address 
arithmetic section. 

45 It is assumed that a start up sequence of instructions is stored in the read only memory starting at an initial 45 
address and that a reset circuit sets the program counter register PC to the initial address. Following the reset 
operation, typically there is a sequence of instructions for storing additional addresses in the address 
registers RX, RY and RD and increment values in increment registers Rl, RJ and RK. These regsters are set by 
auxiliary instructions. Ordinarily the value stored in the registers Rl, RJ and RK are retained therein 

50 throughout a program while the values in the registers RX, RY and RD are modified from time to time during 50 
the execution of a sequence of normal instructions. 

After the processor is reset and the address and increment values stored, the processor can run a valid 
program for processing digital signals. Most of the instructions used for processing signals are normal 
arithmetic instructions. 

55 Information in each of the registers RX, RY, RD, PC, Rl, RJ and RKcan be set to any specific value by an 55 
auxiliary instruction. For example, a first instruction to load address register RY specifies that some 
processor register is to be loaded or set. 

In this first instruction, a control field c contains the required information. This control field c is stored in an 
instruction register IR-C during the instruction fetch cycle. 

50 A fixed data word, associated with the first instruction and loaded into the address arithmetic section 1 24 60 
during the processor cycle in which that instruction is decoded, provides information identifying which 
address register is to be loaded and fixing the increment value to be loaded. The control field and value field 
are transferred from memory by way of the common data and control bus 101 to the control field register 
XSR and the value field register XSL. 

65 While the first instruction is being executed, the control field in the register XSR is decoded in a decoded 65 
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1 57 to select the proper address register. From register XSL the value to be loaded into the address register 
RY is applied to the registers 141 and 142 through a selector circuit 158 and a bus 160 in the execution cycle 
of the first instruction. m . 

A second instruction to load increment register Rl specifies that a processor register is to be loaded or set. 
5 As in the just described example of setting the address register RY, a fixed data word similarly associated 5 
with the second instruction provides a control field to identify the register to be set and a value field to 
establish the value to be loaded. The fields of the fixed data word are applied from the register XSR through 
the decoder 157 and the bus 137 to determine the increment register selected in the set of registers 142 and 
from the register XSL through selector 158 and bus 160 to establish the value to be loaded in the selected 
10 increment register during the execution cycle of the second instruction. 10 
During the processing of both normal and auxiliary instructions, control fields s and t from the instruction 
are stored in the instruction register IR-S,T when that instruction is fetched. These fields are decoded , n a 
decoder 1 52 during the next processor cycle with the decoded information being latched in an AAU control 
*circuit154 This decoded information is applied over a bus 135 to the sets of registers 141 and 142 during the 
15 instruction execute cycle, or second processor cycle, after the fetch. Both an address register and an .15 
increment register or a fixed increment are selected by the information on bus 1 35. The address is applied to 

- the address bus latch 145 and to the input of an adder 147. The increment value simultaneously is applied to 
the other input of the adder 147, which increments the address and stores it for one machine state in an 
adder latch 1 50. During the following machine state, the incremented address is applied by way of a bus 136 

20 to the set of address registers 141. . ., . 20 

Simultaneously during the processing of a normal instruction, part of the information in the fields s and f is 

applied through a single machine state delay in a delay circuit 155. This delayed information provides 

selection information for determining which of the address registers 141 is to be written after the just 

described addressing operation. In the following machine state, the delayed information is decoded in a 
25 decoder 157 and applied over a bus 137 to the address registers 141. At this time, the incremented address 25 

stored in the adder latch 150 is written into the selected address register thus post modifying the address. 
During the processing of an auxiliary register set instruction, the above described operation for writing a 

post modified address back into an address register may be preempted by the register set operation. 

Preempting is accomplished by the decoder 157 in response to information applied thereto from logoic 
30 circuit 122 by way of a path 138, AAU control circuit 154 and delay circuit 155. 30 
When the register set instruction preempts the writing of an address register, the information for selecting 

the address register is applied from register XSR through the decoder 157 and bus 137 to the address 

register set 141 . Simultaneously from the register XSR, information is applied through decoder 1 57 and bus 

137 for selecting information on bus 160 in lieu of information on bus 137. 
3B The address arithmetic section 1 24 transmits addresses by way of the address bus latch 145 for accessing 35 

locations in memories 1 00 and 105, generates new addresses in the adder 147 and sets the address registers 

RX, RY, RD and PC. 

Referring now to Figure 4, the diagram shows that addresses are transmitted to memory as a series of four 
addresses being transmitted during each processor cycle. One of the addresses is transmitted dur.ng each of 
40 four machine states during each processor cycle; The first address transmitted during the first machine state 40 
is the address stored in the program counter register PC. As indicated in Figure 4, this address is transmitted 
automatically during the first machine state of each processor cycle. The second address transmitted dur.ng 

- the second machine state is the address stored in register RD or in register RX. The third address transmitted 
during the third machine state is the address stored in register RX or in the program counter register PC. The 

45 fourth address transmitted during the fourth machine state is the address stored in register RY. . 45 

- Each address transmitted by the address arithmetic section is latched in the address bus latch 145 dur.ng 
the mentioned machine states of the processor cycle. Also during those machine states, the addresses are 
incremented in the address arithmetic unit adder 147 by an increment value that is read out of one of the 

- increment registers Rl, RJ and RK or in the case of the address from register PC, the address is incremented 

50 by + 1. These incrementing operations are accomplished during the same machine state that the address is 50 

Identification of the selected address and increment registers is accomplished by applying the appropriate 
control fields to the instrument register IR-S, T prior to the addressing operation so that the appropriate 
coding is applied to access circuitry for both the address and the increment registers during the machine 
55 state that the address is to be transmitted. Both the address and the value of the increment are read out and 55 
are summed by the adder 147. The resulting incremented address is stored in the adder latch 150 wh.le the 
address is being transmitted from the address bus latch 145. - - C c 

Coding for identifying whichever address register was selected is transferred through the delay circuit 1 55 
to the decode register circuit 157. Delay and decoding are designed so that the incremented address stored 
60 in the adder latch 150 can be written into the address register from which the transmuted address was 

fetched. Thus the transmitted address is post-modified or post-incremented during the processor cycle when 
it is transmitted to the memories 101 and 105. 

Turning now to Figure 2, the arithmetic section 11 0 is organized for pipelined operations. Coefficients 
words x and variable data words y are operands received from the memories by way of the common data 
65 and control bus 101 into coefficient word register 102 and the variable data word register 106. The rounded 
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output words walso" are operands for some operations and are stored in the register 1 18. A new operand is 
received into each of those registers during every processor cycle of a normal instruction. 

The arithmetic section 110 includes three subsections which are independently controllable in response to 
different control fields /, m and n. During the fetch cycle of an instruction, the fields /, m and n are stored in an 
5 instruction register IR-L,M,N. In the next processor cycle, those fields are decoded in a decoder circuit 113 5 
and the result stored in register REG F. During the following processor cycle, this information is transferred 
to an AU control circuit 1 14 for supplying control signals to various subsections of the arithmetic section. 
This latter processor cycle is the execution cycle of the instruction. The control signals provide information 
relating to which choices are to be made from processing options available in each of the subsections. The 
10 multiplier subsection 112 typically generates a product of two operands during each processor cycle. In a 10 
typical multiplication, one operand is the coefficient word x and the second operand is either the variable 
* data word /or the rounded output word w.. 

Coefficient word x is a 16-bit word. These sixteen bits are taken into the register 1 02 from the most 
significant bit lines of the common data and control bus. A selection circuit 1 62 scans the sixteen bits of the 
15 coefficient word, from the least significant bit to the most significant bit, four bits at a time during each of the 15 
four machine states in every processor cycle- Another selection circuit 1 63 concurrently selects either a 20-bit 
variable data word y or a 20-bit rounded output word w. 

Multiplication based on Booth's algorithm is performed. Thus a Booth logic circuit 165 responds to the 
successive 4-bit nibbles to produce control signals for the generation of partial products. 
20 The output from the Booth logic circuit 1 65 during every machine state is latched into a register 1 66. This 20 
output is applied to a circuit 168 which produces the partial products by data selection. 

These partial products are accumulated by adding to prior sums and carries. An adder 170 sums the partial 
products with the prior sum and carry information storing a resulting 36-bit intermediate operand, or 
product word p, in a product register P. Associated registers S and C respectively store the sum and carry 
25 information produced during each processor cycle. .25 
Because the arithmetic section is arranged for pipelined operation, the product register P receives a new 
intermediate operand, or product word, p during every processor cycle of normal instructions. This product 
wordp is applied by way of a bus 172 as an intermediate operand to the input of the accumulator subsection 
115. 

30 In the accumulator subsection, the product word p is added with a 40-bit resultant output word a that may 30 
be shifted by a circuit 174 prior to application as.an input to an adder circuit 175. The adder circuit 175. 
produces sum and carry information which is stored in register 177. The sum and carry information is stored 
in register 177 during every processor cycle. Carries are resolved by carry-look-ahead logic in adder 178. 
Output from adder 1 78 is applied to an input of a logic circuit 1 80 together with the resultant output word a to 

35 generate the next subsequent value of the 40-bit resultant output word a to be stored in register A. Such a 35 
resultant output word is produced and stored in register A during each processor cycle of a normal 
instruction. 

A portion of the resultant output word a is applied as an input to the rounding and overflow circuit 
subsection 116 in 10-bit slices. These slices are clocked through a rounding circuit 182 and an overflow logic 

40 circuit 1 84 to the 20-bit rounded output register W in three consecutive machine states of each processor 40 
cycle. In the fourth machine state, the value in the register W may be corrected for overflow if the value in the 
register A is too large to be represented in the 20-bit register W ; Then the rounded output word can be 
transferred through the common data and control bus 101 to a destination, such as a location in the random 
access memory 105 where it is stored. 

45 The three subsections (multiplier, accumulator and rounding) of the arithmetic section accomplish their 45 
basic operations in one processor cycle each. Outputs of the subsections are stored in registers every 
processor cycle so that the next subsection in line has a stable input to commence the next subsequent 
processor cycle. 

Control of the arithmetic section 110 and of the address arithmetic section 1 24 is accomplished by a 
50 pipelined stream of instructions applied from the memory 100 through the common data and control bus 50 
101. As previously stated with respect to Figure 4, a single instruction is read out of memory during each 
processor cycle of operation. Such an instruction includes several instruction fields, or control messages, /, 
m, n, s and f. Fields /, m and n are transferred through the common bus 1 01 to the register IR-L, M,N for 
controlling the subsections of the arithmetic section 110. Fields s and fare transferred through the common 
55 bus 101 to the register IR-S.T for controlling selection and incrementation of addresses stored in the registers 55 
RX, RY, RD and PC. 

A fuller appreciation of the arrangement for and operation by pipelined control of processing may be 
achieved by the following discussion of a specific example of operation. 

60 Normal operation 60 
A complete normal assembly language instruction includes all of the information required to perform a 
desired arithmetic operation. Assembly language instructions for the digital signal processor are designed to 
represent the control for access to the memory and the control for operation of the arithmetic subsection and 
of the address arithmetic subsection. The arithmetic subsection continuously performs multiplication and 

65 addition operations. The normal arithmetic section operations are characterized by the following general 65 
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expressions: 



xf(y) + f a (a) — a{ — w} 

x-f(w) + f a (a) — a{-* w}, where 



10 



15 



25 



describes a'function of either the data word y or the rounded output w. Such function can be the actual 
value, the sig n, or the absolute value of either one of the variables y or w. 



processor^ ^ ^ products p . x f(y ) or p = x f{w) is formed and is stored in the product register P located at 



40 



45 



55 



60 



p^-xf(y) 
{w^a} a«-p + f a (a) or 

p <— x-f(w) 



Then as the reader proceeds from left to right, the proper values of the product word p and of the contents a 
of the accumulator are more apparent. The proper values are the results of the last preceding 0 P^ atl f n 
which determined those values. Thus the value of the contents a of the accumulator to be rounded into the 
rounded output w or to be used in any fu nction f .(a), is the contents a of the accumulator at the end of the 
65 last previous accumulation. Similarly, the value of a product word p to be used in a current accumulate has 



5 



is a 1 6-bit wide coefficient word usually fetched from read only memory. The coefficient word x also 

could be fetched from random access memory or from an input/output circuit 200 and ordinarily has a 10 

value for all arithmetic operations. 

is a 20-bit wide data word normally fetched from random access memory. Such a data word also could 
be fetched from the input/output circuit 200. 



15 



a represents the 40-bit wide contents of an accumulator register A. In the accumulator register A, the 
least significant thirty-six bits are used to accumulate the product of a 1 6-bit by 20-bit multiplication. 
The four most significant bits provide overflow protection for the accumulation operation. 

20 w is a 20-bit wide rounded or truncated output of the accumulator. The least significant bit of the 20 
rounded output w corresponds with the bit that is fourteenth from the least significant bit of the 
contents a of the accumulator. This correspondence of bits is consistent with an assumption that the 
data word y and the rounded output w are integers and that the coefficient word x usually is restricted 
within the bounds -2 x < 2. 



25 



f a generally describes a function of the contents a of the accumulator, such as a, -a, 0 2a, etc. 
30 The variables x, y, w and p are contained in arithmetic section registers X, Y, W and P, respectively. 30 

The aforementioned general expressions imply that three operations are to be performed by the 



35 



40 



(1) 

35 the output of a multiplier. . 

(2) An accumulation of a resultant word a = p + f a (a) is accomplished in the accumulator. 

(3) Then if required, the resultant word a of the accumulator is rounded and the rounded output word w 
is written into the rounded output register W. 

Each of these three operations is completed during one processor cycle of the digital signal processor. 
Typically during the operations, the coefficient word x has a value and a multiplication forms the product p. 
Also typically during each cycle; all three types of operations are performed concurrently by different 
subsections of the arithmetic section. For some instructions one or more of the three operations may not 
occur. The operation performed by one subsection during one processor cycle is a partial evaluation of a 
different general expression than the expressions concurrently being evaluated in the other subsections. 
Assembly language instructions are converted to machine language instructions which are stored in the 45 
" memory for actually controlling the digital signal processor. Because the operations are dependent upon 
one another and because all of the operations occur concurrently within the processor, it is important to 
know at all times what is stored in various registers and what operation is to be performed thereupon. 
" To avoid confusion regarding which values of the product word p and which values of the contents a of the 
50 accumulator are involved in any processor operation, the following order of operations is recommended 50 
when writing assembly language expressions representing them. 
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a value determined in the last previous multiplication operation. 

Because of the reasons given in the foregoing discussion of the order of processor operations, it is 
important that the information contained in the assembly language instruction be presented to the processor 
in proper order. Information presented in the following order is acceptable to the processor. 
5 (1) A choice of destination is made. The word to be written to the destination is chosen from either the 5 
rounded output word w or the data word y. The chosen word can be written into the random access memory 
or into the input/output circuit. The specific destination of the selected word is given. 

(2) As required by the instruction, there is a choice of whether or not to move the resultant word a into 
the rounded output w. 

10 (3) One accumulation operation is selected from a group of operations having a general expression 10 
a = p + f a {a). 

(4) Specify a multiplication operation producing the product p = x-f(y) by indicating the source XSRC of 
the coefficient word x, the nature of the function f, and the selection of the data word y, together with the 
source YSRC of the data word y. Alternatively specify a multiplication operation producing the product p = 

15 x-f(w) by indicating the source XSRC of the coefficient word x, the nature of the function f, and the selection 15 
of the rounded output w rather than the data word y. 

The following exceptions apply to the above-mentioned teft-to-right rule. When the rounded output w is 
selected for the multiplication, the value of the rounded output w is the value determined by the last 
rounding of the resultant word a as performed in a preceding instruction. If data word y is to be written and a 

20 source for data word y specified, the first step in execution of the instruction moves the data from the 20 
specified source into the data register Y. Thereafter any writing of this new value for data word y can occur. 

The following Table I summarizes the normal assembly language instructions that a programmer would ' 
use for preparing an assembly language program. The syntax of a language called C is used as the assembly 
language which is described in the text entitled. The C Programming Language by B. W. Kernighan et al, 

25 Prentice-Hall, Inc., 1978. Each complete instruction is formed by choosing four statements, one statement 25 
from each column of Table I starting with the lefthand column and working toward the right. In the two 
leftmost columns, the word NOTHING is listed as a valid choice. When the word NOTHING is selected as a 
part of a complete instruction, the corresponding space in the instruction is left blank. Every complete 
assembly language instruction is terminated by a semicolon. 

30 30 

TABLE I 



Normal assembly language instructions 



35 


NOTHING NOTHING 


a=p 


p=XSRC*YSRC 


35 




DEST=y w=a 


a=p-*-a 


p=XSRC*w 




40 


DEST=YSRC 
DEST-w 


a=p-a 

a=p+2*a 

a=p+8*a 


p=XSRC*c 
p=XSRC*abs(YSRC) 
. p=*XSRC*abs(w) 


40 


45 




a=p+a/2 " 
a=p+a/8 


p=XSRC*c # sgn{YSRC) 
p=XSRC*c*sgn(w) 


45 


50 




a=p+a 




50 




In Table I the symbol DEST means a destination statement and is to be replaced in the assembly language 
instruction by the following statements. 




55 




DEST 
*rd++i 




55 


60 




*rd ++j 




60 



*rd + -rk 



obuf 
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where e.g., *rd+ +i means that rd, which is the address of the location in random access memory pointed to 
by the contents of register RD, is post incremented by (he contents / of register Rl. Interpretations of different 
ones of the foregoing destination statements are explained in more detail subsequently. 

Also in Table I, the symbol XSRC means a statement for the source of data word x, and the symbol YSRC 
means a statement for the source of the data word y. Each of those two symbols in Table I is to be replaced, 5 
in any assembly language instruction, by one of the statements in the following two columns: 

XSRC YSRC 

x(oldx) *ry++' 10 

VALUE (immediate x) *ry++j 

*rx+ + i *ry+ + k 

*rx++j ibufy 
*rx++k 
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*rx 
*rx- 
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15 



20 



25 



ibufx 

*(rom+rx+ + i) 
*{rom+rx++j) 

*{rom + rx+ + k) 30 
& LABEL 

35 In the foregoing columns headed XSRC, the symbol VALUE represents a number that appears as an 35 
argument of an instruction, i.e., the 1 6-bit word immediately following the opcode in the read only memory. 
Such argument is addressed by the address stored in the program counter register PC. In other fetches from 
memory, the coefficient word x is addressed by the contents rx of the register RX. The notation *(rom+rx...) 
is used to indicate that the contents rx of the register RX point to read only memory rather than the random 

40 access memory. The symbol & LABEL represents that the value read from memory source x is an address 40 
associated with a label in the program. Other expressions presented in the foregoing two columns for the 
sources of the coefficient word x and the data word y are presented in more detail subsequently,. 
- With respect to forming complete assembly language instructions from the information presented in Table 
I some caution is suggested. If the expression DEST = YSRC is desired in an instruction including the 

45 expression YSRC from the rightmost column, then the expression DEST = y must be used in place of the 45 

* expression DEST = YSRC. If the rounded output wis to be used in the rightmost column, the expression 

DEST = YSRC cannot be selected from the leftmost column. Additionally, NOTHING should be selected from 

the leftmost column when the assembly language instruction is a normal instruction in which the source of 

" the coefficient word x is located in random access memory. 
50 ' in the preparation of a program, a programmer will first write out a series of general mathematical 50 

expressions or operations desired to be performed. These may take, for example, the form 



55 



x-f(w) + f a {a) a{-> w} 

Such a general mathematical expression is translated by the programmer into an assembly language 
statement which takes the following form: 

60 * " m U 60 

*rd++j=w w=a a=p+a p=«rx++i"ry++k; 
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15 



where 

/ means an instruction field for controlling the formation of a product. 
m . means the instruction field for performing an accumulation. 

n means an instruction field for controlling a transfer operation from register A to register W with the 
required rounding. 

s represents an instruction field identifying a write destination. In this example the destination is a 
memory location specified by the address stored in register RD. That address is post-incremented and 
the result stored in the register RD, 

t means an instruction field to fetch information from an address stored in an address arithmetic unit 
register, post-increment that address and store it back into that same register. 

The next step performed by the programmer is to skew in time the assembly language statement as 
follows: 



10 



15 



20 



25 



Time 
i 

i+1 
i+2 
1+3 



n 



m 



p=*rx++i**ry++k; 



a— p+a 



w=a 



*rd++j-w 



20 



25 



30 The resulting skewed assembly language statement, which appears diagonally on the time line of the 
leftmost column, is stated together with skewed assembly language statements representing other general 
mathematical operations. When these skewed assembly language statements are stated together, the 
resulting pieces of different statements which appear in the same row, or during the same interval such as 
interval /, form an assembly language instruction. In the assembly language instruction, the different pieces 

35 of information in the same interval are separate fields of that assembly language instruction. Each of these 
fields controls a separate subsection of the processor for performing a step in the process of evaluation, as 
described by a portion of one of the general mathematical expressions. 

An assembler program, which runs on a general purpose computer, operates on each assembly language 
instruction by moving the source fields two processor.cycles earlier in the program than the rest of the fields 

40 in that same assembly language instruction. This moving of the source fields is done to every assembly 
language instruction in the program. The resulting time line for the foregoing assembler statement, as 
skewed by the programmer and the assembler will appear as follows: 



30 



35 



40 



45 



Time 
i-2 
i-1 



m 



x=*rx+ + i y=*ry+ + k 



/ 



45 



50 



55 



i 

i+1 

i+2. 

i+3 



p=x*y 



a=p+a 



w=a 



50 



55 



*rd++j=w 



Referring now to Figures 5 and 6, there is shown a time line diagram indicating how data is processed in 
60 the digital signal processor. In general, the diagram presents the flow of data through various subsections of 60 
the processor during the evaluation of one general mathematical expression together with parts of other 
mathematical expressions. 

Before attempting to describe the operations represented we will first define symbols used throughout the 
time line diagram of Figures 5 and 6. 
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[. j s a machine language instruction fetched from read only memory during a processor cycle, or 

interval, /and decoded within the processor during a processor cycle, or interval, i+1. In general 
the instruction 1, affects operation of sections of the processor during a processor cycle, or 
interval, 1 +2. As previously mentioned each instruction contains the fields, or control messages, 
5 /, m, n, s and f. 

I r (t) represents the field t in the machine language instruction I; for controlling the fetching of 

operands x i+3 and y^a. These fetches take place during the interval i+3. 

represents the field / in the machine language instruction lj for controlling the computation of a 
product, or intermediate operand, p i+2 during the interval i+2.The product p i+2 is a function of the 



10 MD 



•Mm) 



xi and 



40 



45 



.ai+2 



50 



prod 

operands Xi+i and y,+i- 



Ph-i represents the product formed by the multiplier during the interval This product is an 

intermediate operand which is used as an input to the accumulator for its operation occurring 
during the interval i+2. Product pi +1 is formed in register P under control of the field 1^(1 ). The 
multiplier and multiplicand are the operands xj and y T . 

represents the contents of the accumulator during the interval i+2. This is the desired resultant 
word a i+2 for the expression being evaluated. The word a i+2 is an input for the rounding and 
output circuit subsection for the interval i+3. The rounding operation occurs under the control of 
the field l i+1 (n). 

represents rounded output word w which is available in the register W and which can be stored 
into writeable memory during the interval i+4 under the control of the field l i+2 (s). 



10 



,, iX ,.„ represents the field m in the machine language instruction I, for controlling the accumulation of 
■J5 output from, or desired resultant word, a i+2 during the interval i+2. The resultant word, ai +2 is a 15 

function of the last prior resultant word a (+ i and a product p i+ , previously computed. 

Ii(n) is a field n in the machine language instruction for controlling the transfer of a rounded output 

word w i+2 during interval i+2. Rounded output w i+2 is a function of the last prior rounded output 
20 Wi+1 and the resultant word a i+1 of the accumulator. 20 

li(s) is a field in the machine language instruction for controlling the storing of the rounded output 

word w i+1 and the modification of register stored addresses u i+2 during the interval i+2. The 
modified addresses are a function of the prior address u i+ i and field Ms). The updated memory 
25 state M i+2 is a function of the field l,(s), the prior memory state M i+1 , register stored addresses u i+1 25 

and the rounded output word w i+1 . 

| (s t) is a combination of fields 5 and t within the machine language instruction. The fields control the 
modification of register stored addresses u i+2 during the interval i+2. The modified addresses 
30 u i4 . 2 are also a function of the address u i+1 . ^° 



are operands fetched from memory during the interval /, under control of the field t of the . 
instruction li_ 3 fetched from memory during the interval i-3. Instruction l,_ 3 is decoded during 
35 interval i-2 and controls processing during interval i-1 wherein the addresses for operands X; 35 

and y f are produced. As previously mentioned these operands are accessed from memory during 
interval /.They are processed through the multiplier during the interval i+1 under control of the 
field / of the instruction which is fetched during the interval if.,. This produces the 
intermediate operand or product pi+v 



40 



45 



50 



55 



55 In the diagram of Figures 5 and 6, there is shown all of the processing activities of various processor 
subsections of the digital signal processor together with time in processor cycles. Each column in the chart 
represents a different processor cycle, or time interval, of the processor. Information in each column is 
- closely related to some machine language instruction. Each row represents activities of a different processor 
subsection performing its assigned functions during operation of the digital signal processor. 

60 Since each row bf.the chart represents a different activity, we shall define those activities. The first row 60 
below the processor cycle headings indicates storage activities, i.e., memory fetches and stores. The second 
row presents the times at which instructions are decoded within the digital signal processor. The third row 
shows the computing of the product p by the multiplier subsection of the processor. The fourth row presents 
the accumulating of the resultant word a by the accumulator subsection of the processor. Row five presents 

65 activities of the rounding and overflow subsection of the processor, which produces the rounded output 65 
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word w. The sixth row discloses activities associated with modifying addresses used for fetching data for the 
arithmetic processes. 

The processing of the aforementioned general arithmetic expression can be traced through the various 
sections and subsections of the digital signal processor by reference to Figures 5 and 6. 
5 A first step in the processing of a general arithmetic expression is the fetching of operands for a 5 
multiplication. As previously mentioned, information relating to this fetch operation is placed by the 
assembler program into an interval earlier than the information associated with control of the multiplication 
operation. As a result of this assembler program function, every machine language instruction includes a 
control field for a fetch operation that fetches information from memory for processing to be controlled by a 

10 subsequent machine language instruction. -jq 
As an example of processing an instruction, consider processing a general expression having information 
relating to fetch operations for its operands included within an instruction fetched during the interval i-3 of 
Figure 5. This instruction lj_ 3 is shown in an emphasized box and is labelled with a subscript identifying the 
instruction as the instruction fetched during interval i-3. Each instruction shown in the processor function 

15 chart is similarly designated in accordance with the interval during which the instruction is fetched from 15 
memory. Alsoieach instruction, shown in Figures 5 and 6 includes several fields of control information. Each 
of those fields /, m, n, s and rare shown in parentheses associated with the instructions in the first row 
representing the fetching and storing operations. A separate field or separate fields of an instruction are 
shown in other rows of the chart, e.g., I; (I} in the row for computing products and lj (s, t) in the row for 

20 modifying addresses. - 20 

During the interval i-2, the just fetched instruction li_ 3 is decoded by the processor, as shown in the 
emphasized box in the second row representing the decoding of instructions. 

A fetch operation for the operands x and y, identified by the instruction l;_ 3 , begins during the interval i— 1. 
The fetch operation begins using an address specified in the instruction field lj_ 3 (t). When that address is 

25 used, it is modifecl and stored back in the address arithmetic section as a function of the instruction field 25 
li_ 3 (s,t} and the prior state Uj- 2 of the registers in the address arithmetic section. This modification of 
addresses is shown in the emphasized box under the interval i— 1. Fetch of those operands x and vis 
concluded during interval / when the specific operands x ; and y;, identified by the instruction lj_ 3 , are read out 
of memory and are transferred by way of the common data and control bus respectively to registers REG X 

30 and REG Y. These fetch operations are shown in the emphasized box under the interval /. The operand x-, 30 
typically is read out of read only memo ry, and operand y ; typically is read out of random access memory. 

The address pointers, or the addresses stored in registers RX and RY, which were updated in the prior 
interval i-1 are used for accessing the operands from memory during the interval /". 
The first arithmetic operation to be performed on the operands x t andy ; occurs during interval i+1. At this 

35 time the multiplier subsection responds to the instruction field li-Td) for computing an intermediate 35 
operand, or product, pi+i, as shown in the emphasized box under interval Such product Pi*i is shown as 
a function ofthe operands x; and y ; and of the instruction field lil^D. 

Instruction which includes the field lj_t{1), is fetched from memory during the interval i-1, is decoded 
during interval /and controls sections of the processor during interval i+1. 

40 The next step in evaluating the general expression is processed in the accumulator during interval i+2. 40 
This is shown in Figure 6 in the fourth row representing the accumulation of the resultant word a in an 
emphasized box under the column designated interval i+2. A resultant word a, i2 is shown to be a function of 
the prior resultant word a i+1 stored in the accumulator, the just described intermediate operand, or product, 
Pi+i, and of the instruction field Urn). 

45 If specified by the programmer and after the result is accumulated during interval i+2, that result is 45 
rounded and is stored in the rounded output register W. This rounding operation is shown under the interval ' « 
i+3 in an emphasized box in the fifth row representing rounding of the output. The specific rounding 
operation occurs during interval i+3 where the rounded output w i+3 is shown as a function of the last prior 
rounded output W|* 2 of the rounded output register W, the just described resultant word a^ 2 of the 

50 accumulator, and of the instruction field l j+1 (n). 50 
A final step in processing the general expression is a writing of the rounded output Wj* 3 into memory 
during interval i+4. This is shown in the emphasized box in the first row of the chart under the interval i+4. 
Writing a new memory state M i+4 is a function of the memory state Mj* 3 for interval i+3 of the last prior 
address register state u i+3 , of the last rounded output w } . 3 just described, and of the instruction field lj. 2 (s) 

55 which was fetched during interval i+2 and decoded during interval i+3. 55 
Rounded output w r+3 contained in the rounded output register at the end of interval i+3 is transferred by 
way of the common data and control bus either to the random access memory or to a buffer in the 
input/output circuitry during interval i+4. 
At the same time that the memory write operation occurs during interval i+4, the address arithmetic 

60 section registers are updated based on information in the instruction fetched during interval i+2. The 60 
information used is included in the fields l^ 2 (s,t) of the instruction l^ 2 that is fetched during interval i+2 and 
is decoded during interval i+3. 

During the interval i+2, it is noted that the instruction Ij which wasfetchecd during interval /controls the 
multiplier subsection, the accumulator subsection and the rounding and overflow subsection of the 

65 arithmetic section. This results from the instruction lj being fetched in interval /, decoded in interval i+1 and 65 
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'used for control during interval i+2. No parts remain for controlling subsections of the arithmetic section 
during subsequent intervals, as in prior pipelined control arrangements. Most of the column representing 
interval i+2 is emphasized with heavy lines so that the reader readily can find several fields of the instruction 
h for controlling subsections of the arithmetic section during interval i+2. . -~ ' 

5 Operands for the multiplier operation were fetched during the interval i+1 which follows interval /- The 5 
resulting product p i+2 is formed during the next interval i+2. 

A resultant word a i+2 which is formed during that same interval i+2 is a function of an earlier resultant 
word a i+ i and an earlier produce p i+1 . This resultant word a i+2 is a resultant word evaluated for a different 
general expression than the general expression being evaluated by forming the product p i+2 . This concept 

10 can perhaps be better understood by the realization that the emphasized boxes forming a diagonal from the 10 
top of the column designated processor cycle # down to the fifth row in the column designated processor 
cycle i+3 relate to the evaluation of one general expression. A similar diagonal, shifted one interval to the 
right in each column, relates to the evaluation of another different general expression. 
Typically in a signal processing progam, instructions are executed, in sequence, up to a point where the 

1 5 Vog ram counter PC, is set to the address value in the program store which is the location of the instruction 1 5 
of the beginning of the sequence. Thus the program will operate continuously in a loop executing the same 
sequence of instructionss repeatedly. Furthermore, fixed data words will be stored at memory locations 
"where addresses are interleaved with location of instructions in the program sequence. In this way, as 
shown in Figure 4, the address in the program counter register PC is used to address a fixed data word 

20 during state 2 of processor cycle i+1 . The program counter then is incremented by the fixed increment +1 or 20 
is used to address an instruction, i, +2 , in state 0 of processor cycle i+2. Again the program counter is 
incremented by the fixed increment, + 1 , and used to address the next fixed data word in state 2 of processor 
cycle i+2 Continuing, the program counter is incremented by +1 and is used to address instruction, l i+3 . m 
state 0 of the processor cycle i+3 and so on until the end of the instruction sequence. At that time the 

25 program counter is set, by an auxiliary register set instruction, to the address of the first instruct.on in the 25 

sequence. , 

To this point in the description, only routine normal operations of the digital signal processor have been 
mentioned. Other operations, such as conditional operations, can be performed by the pipelined digital 
signal processor. 

Conditional operation _ 

In many cases, the algorithm, realized by the condition test and execution of alternative operations that are 
dependent upon the outcome of the test may be realized as well by a sequence of one or more instructions 
that either is executed or is not executed. If this sequence is short, the overall savings in processing time may 
35 be large whem compared with using the conditional program transfer technique of the prior art for achieving 35 
the same result. We have discovered that conditional operations occurring in digital signal processing 
frequently can be realized with the use of a sequence of one or more instructions that either is executed or 
not. Therefore the digital signal processor has been designed to process efficiently conditional operat.ons in 
this manner. The concept used, however, has wider applicability to digital processors generally. 
40 For example consider the problem of finding the maximum value of a sequence of samples stored in 40 
memory. The value of each sample in the sequence can be compared with the value of a word in another 
location by using the conventional conditional transfer approach requiring alternative sequential processing 
paths. 



30 



45 



if(x>X ma x) 

50 , x max =x ' 

else 

/*do nothing*/ 

55 



45 



50 



55 



60 



such that the statement x max = x; is bypassed or branched around by conditional change of the program 
60 counter contents. In our digital signal processor, the instruction x max = x is processed (i.e., fetched and 
decoded) in sequence independent of the test with the actual transfer of data to x max being inhibited if the 

test fails. . ■ . 

A conditional instruction causes the processor to perform a condition test operation which is a 
non-arithmetic auxiliary operation. As in normal arithmetic operations described previously, there is a 
65 proper order for writing an assembly language instruction for conditional operations. The following is 65 
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presented in the proper order. 

(1 ) A choice of destination is made. The word to be written to the destination is chosen from either the 
rounded output word w or the data word y. The chosen word can be written into the random access memory 
or into the input/output circuit. The specific destination of the selected word is given. 
5 (2) Specify the condition to be tested and the processor operation to be performed if the test is 5 
successful. 

The following table summarizes the conditional instructions which are formed by choosing one statement 
from each of two columns. 

10 TABLE II 10 

Conditional Instructions 

NOTHING if (CONDITION) dosetO 

15 15 

DEST = YSRC if (CONDITION) doau() 

DEST = w if (CONDITION) dowtO 

20 20 
Meanings for DEST and YSRC are the same as those applicable in Table 1 . The term CONDITION should be 
replaced by one of the following: 

25 CONDITION Description 25 

a==0 Accumulator contents a equals zero. 

a > 0 Accumulator contents a is greater than zero. 

30 - 30 

a < 0 Accumulator contents % is less than zero- 

Each conditional instruction is assembled as a 1 6-bit opcode word followed by a 1 6-bit argument. The 
format for a conditional instruction is 
35 35 



40 where c, s and f are control fields, as in the normal arithmetic instructions. Fields s and r have the same 40 
meaning. Control field c provides control information for the conditional operation. That information - 
includes what operation is to be performed together with the condition to be tested. 

There are three choices of operations provided. A processor address or increment register is set if the 
specified condition is true. The next arithmetic section operation is performed if the condition is true. The 
45 next write operation is performed if the condition is true. For every conditional instruction, the mentioned 45 
operations do not occur if the condition is false. 

Operations which are subject to the condition test are the operations specified in the instruction next 
following the conditional instruction in the pipeline. 

Each conditional instruction processed by the digital signal processor is fetched from the read only 
50 memory 1 00 and is transferred over the data bus to the instruction registers. The control fields s and t are 50 
stored in the instruction register IR-S,T as previously described. Control field c is stored in instruction 
register I R-C. 

By reference to Figure 8, the operation of the arrangement of Figures 1 and 2 will be described performing 
a conditional operation. Most of the conditional operation occurs much like a routine normal operation. 
55 Therefore emphasis will be placed upon those portions of the operation which differfrom a routine normal 55 
operation. The reader is referred back to the preceding descriptive material for completeness. 

In Figure 8, there is shown a conditional instruction h (c,s,t) that is fetched during precessor cycle / and is 
decoded in processor cycle i+1. In this example the conditional instruction l ; (c,s,t) is positioned in the 
pipeline to affect a normal arithmetic instruction l^, (I, m,n,s,t) which is fetched during processor cycle i + 1, 
60 decoded during processor cycle i+2 and is conditionally executed during processor cycle i+3. 60 
Fields s and t of instruction lj(c,s,t) control data fetches and a write operation during the interval i+2. The 
state u i+2 of the registers in the address arithmetic section is updated during interval i+2 as a function of the 
control fields Ms.tjand of the prior state u^ of those registers. A write memory operation Mi. 2 that occurs 
during cycle i+2 is similar to the operation previously described with respect to the normal arithmetic 
65 instruction. Because the conditional instruction is an auxiliary instruction, the arithmetic section 110 is idled 65 
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during interval i-=-2 which is the usual execution cycle for this instruction. Therefore multiplier register P, 
accumulator register A and rounded output register W retain their respective data from the last prior cycle. 
Intermediate operand Pi . 2 equals p,*„ resultant a i+2 equals a i+1 and the rounded output word w i+2 equals 

5 Control field Mc) which is stored in the instruction register IR-C during interval / includes one part 5 
identifying what condition is to be tested and a second part identifying what operation is to be controlled in 
dependence upon the test outcome of the condition. During interval i+ 1 , the two parts of the control field l s (c) 
are decoded in circuits 21 1 and 212 and are stored in registers 213 and 214. 
During interval i+2, the first decoded part of control field l,(c) that is stored in register 21 3 is applied to a 
10 comparator 215 establishing what condition is being tested. Simultaneously, the status of conditions, or io 
flags, V i+2 from the arithmetic section control 1 14 are applied by way of a path 225 to the comparator 215. 
Thus the status of the conditions of the arithmetic section are tested. Comparator 21 5 produces a condition 
true or a condition false signal on lead 221 through which the resulting signal is applied as a conditional 
^control on logic circuit 122. _ioo-« 
15 Also during interval i+2, the logic circuit 122 operating under control of the conditional signal on lead 221 15 
produces further control signals that are dispersed to the circuit DECODE F 1 13, the address arithmetic 
* section control 154, the random access memory 105 and the input/output circuit 200. The result of the 
conditional operation is held at the output of the logic circuit 1 22 for controlling the various sections of the 
processor during interval i+3. '. . , . , , tV - 

20 . Normal arithmetic instruction l i+1 (l,m,n,s,t>, which is to be affected by the conditional instruction Ucs,t), is 20 
fetched during interval i+1 and is decoded during the cycle i+2. Without the preceding conditional 
instruction this instruction would control the processor during interval i+3. Data fetches for interval t+3 

occur as usual. . 
Thereafter during interval i+3, operations executed depend upon the usual operands together with the 

25 state of the control lines from the log ic circuit 1 22, which are conditioned upon the outcome of the 25 
comparison made during interval i+2. 

When the conditional instruction Mcs,t) is a conditional arithmetic section execute, only arithmetic unit 
operations are conditionally executed during the interval i+3. Writing memory is not inhibited at this t.me. If 
the condition is true for the conditional arithmetic section execute instruction, a new product p t+3 . resultant 

30 word a i+3 and rounded output word w i+3 are produced. If the condition is false, control of the arithmetic 30 
section is disabled and no new product, resultant word or rounded output word is formed. The registers P, A 
and W retain values from the last prior interval. All other normal processor operations occur during interval 

' + lf the conditional instruction I, (c,s,t> is a conditional write instruction, only the memory write and output 
35 write operations are affected during interval i+3. Operations of the arithmetic section are not inhibited. If the 35 
condition is true for the conditional write instruction, the memory write operation M r+3 or output write 
operation occurs. If the condition is false, control of the write operation is disabled and the memory retains 
its state M i+2 from the prior interval. Writing to memory or output is controlled by the control field l i# as 
discussed previously for normal instructions. Whether the condition is true or false, all other processor 
40 operations occur normally during interval i+3. If the conditional instruction l,(c.s,t) is a conditional reg.ster 40 
set instruction, only a register set operation is affected during interval i+3. Note that in this case, since the 
register set instruction is an auxiliary instruction, there will be no activity in the arithmetic section. Memory 
or output write may proceed without interference as specified by control field Ms). If the condition is true, the 
" register designated by the register select field of the fixed data word, associated with the register set 
45 instruction, is loaded with the value in the value field of that data word, as explained previously. If the 45 
- condition is false, control of the register set operation is disabled and the register contents are not changed 
by the register set instruction. . 
The foregoing is a description of the arrangement and operation of an embodiment of the invention. 1 ne 
. scope of the invention is considered to include that embodiment together with others obvious to those 
50 skilled in the art. 50 

CLAIMS 

1 A pipelined digital processor having a source (100 or 105) providing a stream of instruction words for 
55 controlling routine processing operations and providing a stream of data words; an arithmetic section (110) 55 
for processing one data word with another data word through selected processing subsections < 1 1 2, 1 1 5, 
1 1 6) performing operations represented by an expression, thereby producing a resultant data word; a 
destination section (105) for receiving the resultant data word from the arithmetic section; the pipelined 
digital processor BEING CHARACTERIZED BY • 
v 60 control means (IR-C) for decoding a single conditional instruction word for controlling performance of a 60 
specific condition test during a first subsequent processor cycle (i.e., i+2); 

the control means further for decoding another instruction word during the first subsequent processor 
cycle for controlling some processing section operations during a second subsequent processor cycle 

(i.e., i+3); and . . . L • , 

65 means (21 5 and 122) responsive to a comparison between the conditions existing in the digital processor 65 
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during the first subsequent processor cycle and the specific condition information included in the conditional 

instruction for selectively disabling control of at least a part of a section of the digital processor during the 

second subsequent processor cycle. 
2. A pipelined digital processor in accordance with claim 1 wherein the pipelined digital processor is 
5 FURTHER CHARACTERIZED BY 5 
the single conditional instruction being a conditional arithmetic section execute instruction, and 
the responsive means enabling control of the arithmetic section during the second subsequent processor 

cycle if the condition is true and disabling control of the arithmetic section during the second subsequent 

processor if the condition is false. 

10 3. A pipelined digital processor in accordance with claim 1 wherein the pipelined digital processor is -jo 
FURTHER CHARACTERIZED BY 
the single conditional instruction being a conditional write instruction, and 

the responsive means enabling control of writing the destination section during the second subsequent 
processor cycle if the condition is true and disabling control of writing the destination section during the 

1 5 second subsequent processor cycle if the condition is false. 1 5 

4. A pipelined digital processor in accordance with claim 1 , wherein the pipelined digital processor is 
FURTHER CHARACTERIZED BY 

the single conditional instruction being a conditional register set instruction, and 
the responsive means enabling control of setting a register section during the second subsequent 
20 processor cycle if the condition is true and disabling control of setting the register section during the second • 20 
subsequent processor cycle if the condition is false. 

5. A pipelined digital processor operating in response to a plurality of control fields in each opcode word 
of a sequence of opcode words, each conditional opcode word being designated l;(c,s,t) and each 
unconditional opcode word being designated l j+1 (l,m,..s,t) where i = 0,1,2..., I,(c} is a conditional control field, 

25 I; (s,t) are control fields, h+iOJ is a first normal control field and lj +1 (m) is a second normal control field, each 25 
normal control field including information for determining a step in processing a selected expression of an 
operand y i+2 ; means (IR-C, IR-S,T, 211,21 2, 21 3, 21 4, 215, 1 22 and DECODE F) for decoding a conditional 
opcode word li(c,s,t) during a first interval (e.g., i + 1 =2) and a normal opcode word l 2 (l,m,..s,t) during a 
second interval (e.g., i+2=3); means for fetching and storing an operand y 3 during the second interval, the 

30 pipelined digital processor BEING CHARACTERIZED BY 30 
at least one processor section (e.g., 110) responsive to some of the decoded fields ! 2 (i,rn,..s,t/ during 3 third 
intervalfor processing the operand y 3 during the third interval if the condition is met and for omitting 
processing the operand y 3 during the third interval if the condition is not met. 

6. A pipelined digital processor operating in response to a plurality of control fields in each opcode word 

35 of a sequence of opcode words, each conditional opcode word being designated l;(c,s,t) and each auxiliary 35 
opcode word being designated l^fc^t) where i = 0,1,2..., I f (c) is a conditional control field, l ; (s,t) are control 
fields l,+i(c) is an auxiliary control field including information for setting a processor register; means (IR-C, 
IR-S,T, 211, 212, 213,214, 215, 122 and 154) for decoding a conditional opcode word Mc,s,t) during a first 
interval (e.g., i+ 1 = 2) and an auxiliary opcode word l 2 (c,s,t) during a second interval (e.g., i + 2-3); means for 

40 fetching and storing a register control field (XSR, XSL) during the second interval, the pipelined digital 40 
processor BEING CHARACTERIZED BY 

at least one processor section (e.g., 1 1 0) responsive to some of the decoded fields l 2 (c,s,t) during a third 
interval for setting the register during the third interval if the condition is met and for disabling control of 
setting the register during the third interval if the condition is not met. 

45 7. A pipelined digital processor operating in response to a plurality of control fields in each opcode word 45 
of a sequence of opcode words, each conditional opcode word being designated Uc,s,t) and each normal * 
opcode word being designated I^IUnv.s.t) where i = 0,1,2..., Ij(c) is a conditional control field, Ij(s.t) are 
control fields, l^fs) is a normal control field including information for identifying a destination for the 
processing result w 3 ;means (IR-C, IR-S, T 21 1, 212, 213, 214, 21 5, 1 22 and 221 > for decoding a conditional 

50 - opcode word l 1 {c,s,t) during a first interval (e.g., i+1 =2) and a normal opcode word l 2 (l,m,..s,t) during a 50 
second interval ((e.g., i+2=3); means for transferring and writing a result w 3 during a third interval, the 
pipelined digital processor BEING CHARACTERIZED BY 

at least one processor section (e.g., 105) responsive to the decoded normal control field l 2 during the third 
interval for writing the result w 3 during the third interval if the condition is met and for disabling the control 

55 of the writing of the result w 3 during the third interval if the condition is not met. 55 
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